Internship Report Compositions of Extended Top-down Tree Transducers
نویسندگان
چکیده
Many aspects of machine translation of natural languages can be formalized by employing weighted finite-state (string) transducers [22, 40]. Successful implementations based on this wordor phrasebased approach are, for example, the At&t Fsm toolkit [41], Xerox’s finite-state calculus [24], the Rwth toolkit [23], Carmel [19], and OpenFst [2]. However, the phrase-based approach is not expressive enough, for example, to easily handle the rotation needed in the translation of the English structure NP-V-NP (subject-verb-noun phrase) to the Arabic structure V-NP-NP. A finitestate transducer can only implement this rotation by storing the subject, which might be very long, in its finite memory. Syntax-based (or tree-based) formalisms can remedy this shortage. An example of such formalisms is the top-down tree transducer [42, 43], of which a weighted version is implemented in the toolkit Tiburon [38], together with some standard operations. Those weighted top-down tree transducers [29, 14, 17] (also called ‘tree series transducers’) are a joint generalization of the unweighted top-down tree transducer (tdtt) [42, 43] and the weighted tree automaton [7, 10, 1, 27, 16, 9, 8]. During my internship, I investigated compositions of weighted and unweighted extended topdown tree transducers. An unweighted tree transducer computes a relation τ between input and output trees, and a weighted tree transducer computes a weighted relation between input and output trees (i.e., it assigns a weight to each pair of input and output trees). An unweighted tree transducer can be seen as a weighted tree transducer, where the weighted relation assigns true (resp. false) to a pair of trees if this pair is (resp. is not) in the relation τ . Using the real numbers as weight structure, we compose two weighted relations τ1 : A × B → IR and τ2 : B × C → IR by requiring that (τ1 ; τ2)(a, c) = ∑
منابع مشابه
Compositions of Extended Top-down Tree Transducers
Unfortunately, the class of transformations computed by linear extended top-down tree transducers with regular look-ahead is not closed under composition. It is shown that the class of transformations computed by certain linear bimorphisms coincides with the previously mentioned class. Moreover, it is demonstrated that every linear epsilon-free extended top-down tree transducer with regular loo...
متن کاملComposition Closure of ε-Free Linear Extended Top-Down Tree Transducers
The expressive power of compositions of linear extended topdown tree transducers with and without regular look-ahead is investigated. In particular, the restrictions of ε-freeness, strictness, and nondeletion are considered. The composition hierarchy is finite for all ε-free variants of these transducers except for ε-free nondeleting linear extended top-down tree transducers. The least number o...
متن کاملSurvey: Weighted Extended Top-Down Tree Transducers Part III - Composition
In this survey (functional) compositions of weighted tree transformations computable by weighted extended top-down tree transducers are investigated. The existing results in the literature are explained and illustrated. It is argued, why certain compositions are not possible in the general case, and 3 informed conjectures provide an insight into potentially 3 new composition results that extend...
متن کاملCompositions of Top-down Tree Transducers with "-rules
Top-down tree transducers with "-rules ("tdtt) are a restricted version of extended top-down tree transducers. They are implemented in the framework Tiburon and ful ll some criteria desirable in a machine translation model. However, they compute a class of transformations that is not closed under composition (not even for linear and nondeleting "tdtt). A composition construction that composes "...
متن کاملExtended Multi Bottom-Up Tree Transducers Composition and Decomposition
Extended multi bottom-up tree transducers are de ned and investigated. They are an extension of multi bottom-up tree transducers by arbitrary, not just shallow, left-hand sides of rules; this includes rules that do not consume input. It is shown that such transducers, even linear ones, can compute all transformations that are computed by linear extended top-down tree transducers, which are a th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011